Incremental Learning of Procedural Planning Knowledge in Challenging Environments
نویسندگان
چکیده
Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning methods to directly modify an agent’s execution knowledge. These systems are robust in dynamic and complex environments but generally do not support planning or the pursuit of multiple goals. In contrast, symbolic theory revision systems learn declarative planning knowledge that allows them to pursue multiple goals in large state spaces, but these approaches are generally only applicable to fully sensed, deterministic environments with no exogenous events. This research investigates the hypothesis that by limiting an agent to procedural access to symbolic planning knowledge, the agent can combine the powerful, knowledge intensive learning performance of the theory revision systems with the robust performance in complex environments of the reinforcement learners. The system, IMPROV, uses an expressive knowledge representation so that it can learn complex actions that produce conditional or sequential effects over time. By developing learning methods that only require limited procedural access to the agent's knowledge, IMPROV's learning remains tractable as the agent's knowledge is scaled to large problems. IMPROV learns to correct operator precondition and effect knowledge in complex environments that include such properties as noise, multiple agents and time-critical tasks and demonstrates a general learning method that can be easily strengthened through the addition of many different kinds of knowledge.
منابع مشابه
Learning Procedural Planning Knowledge in Complex Environments
LEARNING PROCEDURAL PLANNING KNOWLEDGE IN COMPLEX ENVIRONMENTS by Douglas John Pearson Chair: John E. Laird In complex, dynamic environments, an agent's knowledge of the environment (its domain knowledge) will rarely be complete and correct. Existing approaches to learning and correcting domain knowledge have focused on either learning procedural knowledge to directly guide execution (e.g. rein...
متن کاملToward Incremental Knowledge Correction for Agents in Complex Environments
In complex, dynamic environments, an agent's domain knowledge will rarely be complete and correct. Existing deliberate approaches to domain theory correction are signi cantly restricted in the environments where they can be used. These systems are typically not used in agent-based tasks and rely on declarative representations to support non-incremental learning. This research investigates the u...
متن کاملearning roeedura
Autonomous agents functioning in complex and rapidly changing environments can improve their task performance if they update and correct their world model over the life of the agent. Existing research on this problem can be divided into two classes. First, reinforcement learners that use weak inductive methods to directly modify an agent’s procedural execution knowledge. These systems are robus...
متن کاملA Comparison of Expert and Novice Iranian EFL Teachers’ Procedural Knowledge in Iranian Language Institutes and Universities
This study sought to compare Iranian EFL novice and expert teachers regarding their procedural knowledge in Iranian language institutes and universities. A questionnaire was developed based on the literature, the theoretical framework, and the results of a qualitative study. This questionnaire was administered to the whole sample of the study who was 200 Iranian EFL teachers from different gend...
متن کاملThe conflict between conceptual and procedural knowledge: Should we need to understand in order to be able to do, or vice versa?
The question in the title is the major foci when planning any learning environments. Basically it means that we have to understand how procedural knowledge and conceptual knowledge relate to each other. It seems appropriate to underline that these two types of knowledge must be somehow related when the learning process is our focus. However, it is the variables in the assessment of this process...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Intelligence
دوره 21 شماره
صفحات -
تاریخ انتشار 2005